Joint Stochastic Approximation learning of Helmholtz Machines
نویسندگان
چکیده
Though with progress, model learning and performing posterior inference still remains a common challenge for using deep generative models, especially for handling discrete hidden variables. This paper is mainly concerned with algorithms for learning Helmholz machines, which is characterized by pairing the generative model with an auxiliary inference model. A common drawback of previous learning algorithms is that they indirectly optimize some bounds of the targeted marginal log-likelihood. In contrast, we successfully develop a new class of algorithms, based on stochastic approximation (SA) theory of the Robbins-Monro type, to directly optimize the marginal log-likelihood and simultaneously minimize the inclusive KL-divergence. The resulting learning algorithm is thus called joint SA (JSA). Moreover, we construct an effective MCMC operator for JSA. Our results on the MNIST datasets demonstrate that the JSA’s performance is consistently superior to that of competing algorithms like RWS, for learning a range of difficult models.
منابع مشابه
Bayesian Unsupervised Learning
Multilayer architectures such as those used in Bayesian belief networks and Helmholtz machines provide a powerful framework for representing and learning higher order statistical relations among inputs. Because exact probability calculations with these models are often intractable, there is much interest in nding approximate algorithms. We present an algorithm that eeciently discovers higher or...
متن کاملBayesian Unsupervised Learning of Higher Order Structure
Multilayer architectures such as those used in Bayesian belief networks and Helmholtz machines provide a powerful framework for representing and learning higher order statistical relations among inputs. Because exact probability calculations with these models are often intractable, there is much interest in finding approximate algorithms. We present an algorithm that efficiently discovers highe...
متن کاملUsing Stochastic Helmholtz Machine for Text Learning
We present an approach for text analysis, especially for topic words extraction and document classification, based on a probabilistic generative model. Generative models are useful since they can extract the underlying causal structure of data objects. For this model, a stochastic Helmholtz machine is used and it is fitted using the wake-sleep algorithm, a simple stochastic learning algorithm. ...
متن کاملExploration of Mean-Field Approximation for Feed-Forward Networks - Neural Networks, 1999. IJCNN '99. International Joint Conference on
We present a formulation of mean-field approximation for layered feed-forward stochastic networks. In this formulation, one can obtain not only estimates of averages for state variables of the networks but also those of intra-layer correlations, the latter of which cannot be obtained by the conventional mean-jield approximation. Moreovel; this formulation provides a pamework to treat “condition...
متن کاملA Unified Learning Scheme: Bayesian-Kullback Ying-Yang Machines
A Bayesian-Kullback learning scheme, called Ying-Yang Machine, is proposed based on the two complement but equivalent Bayesian representations for joint density and their Kullback divergence. Not only the scheme unifies existing major supervised and unsupervised learnings, including the classical maximum likelihood or least square learning, the maximum information preservation, the EM & em algo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1603.06170 شماره
صفحات -
تاریخ انتشار 2016